Rough Hypercuboid Based Supervised Regularized Canonical Correlation for Multimodal Data Analysis
نویسندگان
چکیده
One of the main problems in real life omics data analysis is how to extract relevant and non-redundant features from high dimensional multimodal data sets. In general, supervised regularized canonical correlation analysis (SRCCA) plays an important role in extracting new features from multimodal omics data sets. However, the existing SRCCA optimizes regularization parameters based on the quality of first pair of canonical variables only using standard feature evaluation indices. In this regard, this paper introduces a new SRCCA algorithm, integrating judiciously the merits of SRCCA and rough hypercuboid approach, to extract relevant and nonredundant features in approximation spaces from multimodal omics data sets. The proposed method optimizes regularization parameters of the SRCCA based on the quality of a set of pairs of canonical variables using rough hypercuboid approach. While the rough hypercuboid approach provides an efficient way to calculate the degree of dependency of class labels on feature set in approximation spaces, the merit of SRCCA helps in extracting non-redundant features from multimodal data sets. The effectiveness of the proposed approach, along with a comparison with related existing approaches, is demonstrated on several real life data sets.
منابع مشابه
Appendix: Multimodal Omics Data Integration Using Max Relevance-Max Significance Criterion
This paper presents a novel supervised regularized canonical correlation analysis, termed as CuRSaR, to extract relevant and significant features from multimodal high dimensional omics data sets [1]. The proposed method extracts a new set of features from two multidimensional data sets by maximizing the relevance of extracted features with respect to sample categories and significance among the...
متن کاملSemi-supervised Laplacian Regularization of Kernel Canonical Correlation Analysis
Kernel canonical correlation analysis (KCCA) is a dimensionality reduction technique for paired data. By finding directions that maximize correlation, KCCA learns representations that are more closely tied to the underlying semantics of the data rather than noise. However, meaningful directions are not only those that have high correlation to another modality, but also those that capture the ma...
متن کاملAsymmetrically Weighted CCA And Hierarchical Kernel Sentence Embedding For Multimodal Retrieval
Joint modeling of language and vision has been drawing increasing interest. A multimodal data representation allowing for bidirectional retrieval of images by sentences and vice versa is a key aspect of this modeling. In this paper we show that a cross-view mapping of the search space to the query space achieves state of the art performance in bidirectional retrieval using off the shelf feature...
متن کاملImproving Cancer Classification Accuracy Mistreatment Principle Element Analysis Methodology
Cancer classification enables the definition of therapeutic groups, for which therapeutic protocols can be elaborated, taking into account all treatment possibilities. Most classifications are based on clinical data. Most of the tumors have similar appearance so histological analysis tends to be unreliable. The advances in microarray technology make individualized treatment possible and when th...
متن کاملA Supervised Combined Feature Extraction Method for Recognition
Multimodal recognition is an emerging technique to overcome the non-robustness of the unimodal recognition in real applications. Canonical correlation analysis (CCA) has been employed as a powerful tool for feature fusion in the realization of such multimodal system. However, CCA is the unsupervised feature extraction and it does not utilize the class information of the samples, resulting in th...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Fundam. Inform.
دوره 148 شماره
صفحات -
تاریخ انتشار 2016